NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Understanding the Response to Open-Source Dependency Abandonment in the npm Ecosystem

https://doi.org/10.1109/ICSE55347.2025.00004

Miller, Courtney; Jahanshahi, Mahmoud; Mockus, Audris; Vasilescu, Bogdan; Kastner, Christian (April 2025, IEEE)

Free, publicly-accessible full text available April 26, 2026
Elevating Jupyter Notebook Maintenance Tooling by Identifying and Extracting Notebook Structures

https://doi.org/10.1109/ICSME55016.2022.00047

Jiang, Yuan; Kastner, Christian; Zhou, Shurui (October 2022, 2022 IEEE International Conference on Software Maintenance and Evolution (ICSME))

Data analysis is an exploratory, interactive, and often collaborative process. Computational notebooks have become a popular tool to support this process, among others because of their ability to interleave code, narrative text, and results. However, notebooks in practice are often criticized as hard to maintain and being of low code quality, including problems such as unused or duplicated code and out-of-order code execution. Data scientists can benefit from better tool support when maintaining and evolving notebooks. We argue that central to such tool support is identifying the structure of notebooks. We present a lightweight and accurate approach to extract notebook structure and outline several ways such structure can be used to improve maintenance tooling for notebooks, including navigation and finding alternatives.
more » « less
Full Text Available
Feature Interactions on Steroids: On the Composition of ML Models

https://doi.org/10.1109/MS.2021.3134386

Apel, Sven; Kastner, Christian; Kang, Eunsuk (May 2022, IEEE Software)

Full Text Available
Containing Malicious Package Updates in npm with a Lightweight Permission System

https://doi.org/10.1109/ICSE43902.2021.00121

Ferreira, Gabriel; Jia, Limin; Sunshine, Joshua; Kastner, Christian (May 2021, 2021 IEEE/ACM 43rd International Conference on Software Engineering (ICSE))

The large amount of third-party packages available in fast-moving software ecosystems, such as Node.js/npm, enables attackers to compromise applications by pushing malicious updates to their package dependencies. Studying the npm repository, we observed that many packages in the npm repository that are used in Node.js applications perform only simple computations and do not need access to filesystem or network APIs. This offers the opportunity to enforce least-privilege design per package, protecting applications and package dependencies from malicious updates. We propose a lightweight permission system that protects Node.js applications by enforcing package permissions at runtime. We discuss the design space of solutions and show that our system makes a large number of packages much harder to be exploited, almost for free.
more » « less
Full Text Available
How Do Code Changes Evolve in Different Platforms? A Mining-Based Investigation

https://doi.org/10.1109/ICSME.2019.00033

Viggiato, Markos; Oliveira, Johnatan; Figueiredo, Eduardo; Jamshidi, Pooyan; Kastner, Christian (September 2019, Proceedings of the 35th International Conference on Software Maintenance and Evolution (ICSME))

Software developed in different platforms has different characteristics and needs. More specifically, code changes are differently performed in the mobile platform compared to non-mobile platforms (e.g., desktop and Web platforms). Prior works have investigated the differences in specific platforms. However, we still lack a deeper understanding of how code changes evolve across different software platforms. In this paper, we present a study aiming at investigating the frequency of changes and how source code changes, build changes and test changes co-evolve in mobile and non-mobile platforms. We developed linear regression models to explain which factors influence the frequency of changes in different platforms and applied the Apriori algorithm to find types of changes that frequently occur together. Our findings show that non-mobile repositories have a higher number of commits per month compared to mobile and our regression models suggest that being mobile significantly impacts on the number of commits in a negative direction when controlling for confound factors, such as code size. We also found that developers do not usually change source code files together with build files or test files. We argue that our results can provide valuable information for developers on how changes are performed in different platforms so that practices adopted in successful software systems can be followed.
more » « less
Full Text Available
Identifying Redundancies in Fork-based Development

https://doi.org/10.1109/SANER.2019.8668023

Ren, Luyao; Zhou, Shurui; Kastner, Christian; Wasowski, Andrzej (February 2019, 2019 IEEE 26th International Conference on Software Analysis, Evolution and Reengineering (SANER))

Fork-based development is popular and easy to use, but makes it difficult to maintain an overview of the whole community when the number of forks increases. This may lead to redundant development where multiple developers are solving the same problem in parallel without being aware of each other. Redundant development wastes effort for both maintainers and developers. In this paper, we designed an approach to identify redundant code changes in forks as early as possible by extracting clues indicating similarities between code changes, and building a machine learning model to predict redundancies. We evaluated the effectiveness from both the maintainer's and the developer's perspectives. The result shows that we achieve 57-83% precision for detecting duplicate code changes from maintainer's perspective, and we could save developers' effort of 1.9-3.0 commits on average. Also, we show that our approach significantly outperforms existing state-of-art.
more » « less
Full Text Available
Intelligently Transparent Software Ecosystems

https://doi.org/10.1109/MS.2015.156

Herbsleb, James; Kastner, Christian; Bogart, Christopher (January 2016, IEEE Software)

Full Text Available

Search for: All records